67 research outputs found

    An improved, high-quality draft genome sequence of the Germination-Arrest Factor-producing Pseudomonas fluorescens WH6

    Get PDF
    <p>Abstract</p> <p>Background</p> <p><it>Pseudomonas fluorescens </it>is a genetically and physiologically diverse species of bacteria present in many habitats and in association with plants. This species of bacteria produces a large array of secondary metabolites with potential as natural products. <it>P. fluorescens </it>isolate WH6 produces Germination-Arrest Factor (GAF), a predicted small peptide or amino acid analog with herbicidal activity that specifically inhibits germination of seeds of graminaceous species.</p> <p>Results</p> <p>We used a hybrid next-generation sequencing approach to develop a high-quality draft genome sequence for <it>P. fluorescens </it>WH6. We employed automated, manual, and experimental methods to further improve the draft genome sequence. From this assembly of 6.27 megabases, we predicted 5876 genes, of which 3115 were core to <it>P. fluorescens </it>and 1567 were unique to WH6. Comparative genomic studies of WH6 revealed high similarity in synteny and orthology of genes with <it>P. fluorescens </it>SBW25. A phylogenomic study also placed WH6 in the same lineage as SBW25. In a previous non-saturating mutagenesis screen we identified two genes necessary for GAF activity in WH6. Mapping of their flanking sequences revealed genes that encode a candidate anti-sigma factor and an aminotransferase. Finally, we discovered several candidate virulence and host-association mechanisms, one of which appears to be a complete type III secretion system.</p> <p>Conclusions</p> <p>The improved high-quality draft genome sequence of WH6 contributes towards resolving the <it>P. fluorescens </it>species, providing additional impetus for establishing two separate lineages in <it>P. fluorescens</it>. Despite the high levels of orthology and synteny to SBW25, WH6 still had a substantial number of unique genes and represents another source for the discovery of genes with implications in affecting plant growth and health. Two genes are demonstrably necessary for GAF and further characterization of their proteins is important for developing natural products as control measure against grassy weeds. Finally, WH6 is the first isolate of <it>P. fluorescens </it>reported to encode a complete T3SS. This gives us the opportunity to explore the role of what has traditionally been thought of as a virulence mechanism for non-pathogenic interactions with plants.</p

    Microbial Community Structure and Functional Potential Along a Hypersaline Gradient

    Get PDF
    Salinity is one of the strongest environmental drivers of microbial evolution and community composition. Here we aimed to determine the impact of salt concentrations (2.5, 7.5, and 33.2%) on the microbial community structure of reclaimed saltern ponds near San Francisco, California, and to discover prospective enzymes with potential biotechnological applications. Community compositions were determined by 16S rRNA amplicon sequencing revealing both higher richness and evenness in the pond sediments compared to the water columns. Co-occurrence network analysis additionally uncovered the presence of microbial seed bank communities, potentially primed to respond to rapid changes in salinity. In addition, functional annotation of shotgun metagenomic DNA showed different capabilities if the microbial communities at different salinities for methanogenesis, amino acid metabolism, and carbohydrate-active enzymes. There was an overall shift with increasing salinity in the functional potential for starch degradation, and a decrease in degradation of cellulose and other oligosaccharides. Further, many carbohydrate-active enzymes identified have acidic isoelectric points that have potential biotechnological applications, including deconstruction of biofuel feedstocks under high ionic conditions. Metagenome-assembled genomes (MAGs) of individual halotolerant and halophilic microbes were binned revealing a variety of carbohydrate-degrading potential of individual pond inhabitants

    Metagenomics reveals sediment microbial community response to Deepwater Horizon oil spill

    Get PDF
    The Deepwater Horizon (DWH) oil spill in the spring of 2010 resulted in an input of ∼4.1 million barrels of oil to the Gulf of Mexico; >22% of this oil is unaccounted for, with unknown environmental consequences. Here we investigated the impact of oil deposition on microbial communities in surface sediments collected at 64 sites by targeted sequencing of 16S rRNA genes, shotgun metagenomic sequencing of 14 of these samples and mineralization experiments using (14)C-labeled model substrates. The 16S rRNA gene data indicated that the most heavily oil-impacted sediments were enriched in an uncultured Gammaproteobacterium and a Colwellia species, both of which were highly similar to sequences in the DWH deep-sea hydrocarbon plume. The primary drivers in structuring the microbial community were nitrogen and hydrocarbons. Annotation of unassembled metagenomic data revealed the most abundant hydrocarbon degradation pathway encoded genes involved in degrading aliphatic and simple aromatics via butane monooxygenase. The activity of key hydrocarbon degradation pathways by sediment microbes was confirmed by determining the mineralization of (14)C-labeled model substrates in the following order: propylene glycol, dodecane, toluene and phenanthrene. Further, analysis of metagenomic sequence data revealed an increase in abundance of genes involved in denitrification pathways in samples that exceeded the Environmental Protection Agency (EPA)'s benchmarks for polycyclic aromatic hydrocarbons (PAHs) compared with those that did not. Importantly, these data demonstrate that the indigenous sediment microbiota contributed an important ecosystem service for remediation of oil in the Gulf. However, PAHs were more recalcitrant to degradation, and their persistence could have deleterious impacts on the sediment ecosystem

    GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences

    Get PDF
    GENE-counter is a complete Perl-based computational pipeline for analyzing RNA-Sequencing (RNA-Seq) data for differential gene expression. In addition to its use in studying transcriptomes of eukaryotic model organisms, GENE-counter is applicable for prokaryotes and non-model organisms without an available genome reference sequence. For alignments, GENE-counter is configured for CASHX, Bowtie, and BWA, but an end user can use any Sequence Alignment/Map (SAM)-compliant program of preference. To analyze data for differential gene expression, GENE-counter can be run with any one of three statistics packages that are based on variations of the negative binomial distribution. The default method is a new and simple statistical test we developed based on an over-parameterized version of the negative binomial distribution. GENE-counter also includes three different methods for assessing differentially expressed features for enriched gene ontology (GO) terms. Results are transparent and data are systematically stored in a MySQL relational database to facilitate additional analyses as well as quality assessment. We used next generation sequencing to generate a small-scale RNA-Seq dataset derived from the heavily studied defense response of Arabidopsis thaliana and used GENE-counter to process the data. Collectively, the support from analysis of microarrays as well as the observed and substantial overlap in results from each of the three statistics packages demonstrates that GENE-counter is well suited for handling the unique characteristics of small sample sizes and high variability in gene counts

    The genetic architecture of the human cerebral cortex

    Get PDF
    The cerebral cortex underlies our complex cognitive capabilities, yet little is known about the specific genetic loci that influence human cortical structure. To identify genetic variants that affect cortical structure, we conducted a genome-wide association meta-analysis of brain magnetic resonance imaging data from 51,665 individuals. We analyzed the surface area and average thickness of the whole cortex and 34 regions with known functional specializations. We identified 199 significant loci and found significant enrichment for loci influencing total surface area within regulatory elements that are active during prenatal cortical development, supporting the radial unit hypothesis. Loci that affect regional surface area cluster near genes in Wnt signaling pathways, which influence progenitor expansion and areal identity. Variation in cortical structure is genetically correlated with cognitive function, Parkinson's disease, insomnia, depression, neuroticism, and attention deficit hyperactivity disorder

    Dissecting the Shared Genetic Architecture of Suicide Attempt, Psychiatric Disorders, and Known Risk Factors

    Get PDF
    Background Suicide is a leading cause of death worldwide, and nonfatal suicide attempts, which occur far more frequently, are a major source of disability and social and economic burden. Both have substantial genetic etiology, which is partially shared and partially distinct from that of related psychiatric disorders. Methods We conducted a genome-wide association study (GWAS) of 29,782 suicide attempt (SA) cases and 519,961 controls in the International Suicide Genetics Consortium (ISGC). The GWAS of SA was conditioned on psychiatric disorders using GWAS summary statistics via multitrait-based conditional and joint analysis, to remove genetic effects on SA mediated by psychiatric disorders. We investigated the shared and divergent genetic architectures of SA, psychiatric disorders, and other known risk factors. Results Two loci reached genome-wide significance for SA: the major histocompatibility complex and an intergenic locus on chromosome 7, the latter of which remained associated with SA after conditioning on psychiatric disorders and replicated in an independent cohort from the Million Veteran Program. This locus has been implicated in risk-taking behavior, smoking, and insomnia. SA showed strong genetic correlation with psychiatric disorders, particularly major depression, and also with smoking, pain, risk-taking behavior, sleep disturbances, lower educational attainment, reproductive traits, lower socioeconomic status, and poorer general health. After conditioning on psychiatric disorders, the genetic correlations between SA and psychiatric disorders decreased, whereas those with nonpsychiatric traits remained largely unchanged. Conclusions Our results identify a risk locus that contributes more strongly to SA than other phenotypes and suggest a shared underlying biology between SA and known risk factors that is not mediated by psychiatric disorders.Peer reviewe

    Multiple Mutations Associated with Emergent Variants Can Be Detected as Low-Frequency Mutations in Early SARS-CoV-2 Pandemic Clinical Samples

    No full text
    Genetic analysis of intra-host viral populations provides unique insight into pre-emergent mutations that may contribute to the genotype of future variants. Clinical samples positive for SARS-CoV-2 collected in California during the first months of the pandemic were sequenced to define the dynamics of mutation emergence as the virus became established in the state. Deep sequencing of 90 nasopharyngeal samples showed that many mutations associated with the establishment of SARS-CoV-2 globally were present at varying frequencies in a majority of the samples, even those collected as the virus was first detected in the US. A subset of mutations that emerged months later in consensus sequences were detected as subconsensus members of intra-host populations. Spike mutations P681H, H655Y, and V1104L were detected prior to emergence in variant genotypes, mutations were detected at multiple positions within the furin cleavage site, and pre-emergent mutations were identified in the nucleocapsid and the envelope genes. Because many of the samples had a very high depth of coverage, a bioinformatics pipeline, &ldquo;Mappgene&rdquo;, was established that uses both iVar and LoFreq variant calling to enable identification of very low-frequency variants. This enabled detection of a spike protein deletion present in many samples at low frequency and associated with a variant of concern
    corecore